Interpretability and Unification

نویسندگان

چکیده

In a recent reply to our article, “What is Interpretability?,” Prasetya argues against position that artificial neural networks are explainable. It claimed indefeasibility thesis—that adding complexity an explanation of phenomenon does not make the any less explainable—is false. More precisely, unificationist explanations defeasible increasing complexity, and thus, we may be able provide such highly complex AI models. The highlights important lacuna in original paper, omission account explanation, affords us opportunity respond. Here, argue explainable way should satisfy unificationists interpretability methods present ways which ML theories can achieve unification.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phases and Interpretability

We adopt a theory of relativisation based on the idea that relatives, like wh-constructions in the analysis of Chomsky (1998), require two sorts of features to construct their LF-interpretation. We argue that it is the variable interpetability of these features that gives rise to different syntactic patterns. We use this theory to provide an explanation for some curious syntactic facts found in...

متن کامل

Unification and Default Unification

Unification of (typed) feature structures is an important tool for capturing linguistic generalizations. It is the core of a family of linguistic formalisms, it has been applied in phonology, morphology, syntax, and semantics, and is dominant in most linguistically informed work in natural language processing. Unification monotonically merges information. Default unification provides a non-mono...

متن کامل

Interpretability Logic

This is an overview a study of interpretability logic in Zagreb for the last twenty years: a brief history and some planes for further research. The idea of treating a provability predicate as a modal operator goes back to Gödel. The same idea was taken up later by Kripke and Montague, but only in the mid–seventies was the correct choice of axioms, based on Löb’s theorem, seriously considered b...

متن کامل

Computable Functors and Effective interpretability

Our main result is the equivalence of two notions of reducibility between structures. One is a syntactical notion which is an effective version of interpretability as in model theory, and the other one is a computational notion which is a strengthening of the wellknown Medvedev reducibility. We extend our result to effective biinterpretability and also to effective reductions between classes of...

متن کامل

Manipulating and Measuring Model Interpretability

Despite a growing body of research focused on creating interpretable machine learning methods, there have been few empirical studies verifying whether interpretable methods achieve their intended effects on end users. We present a framework for assessing the effects of model interpretability on users via pre-registered experiments in which participants are shown functionally identical models th...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Philosophy & Technology

سال: 2022

ISSN: ['2210-5441', '2210-5433']

DOI: https://doi.org/10.1007/s13347-022-00537-z